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Draft Genome Sequences of Two Clinical Isolates of Streptococcus 
mutatis 
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We report the draft genome sequences of PKUSS-HGOl and PKUSS-LGOl, two clinical isolates of Streptococcus mutatis from 
human dental plaque. The genomics information will facilitate the study of the mechanisms of pathogenicity and evolution of 
S. mutans. 
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Streptococcus mutans is a Gram-positive organism that is the 
primary causative agent in the formation of dental caries in 
humans. S. mutans metaboHzes a variety of carbohydrates, gener- 
ating organic acid in dental plaque. The acidic environment re- 
sults in mineral loss on the surface of teeth and eventually causes 
tooth decay (1). Furth more, it has been reported that S. mutans is 
implicated not only in infective endocarditis (2) but also in hem- 
orrhagic stroke (3). To better understand the molecular basis of 
mechanisms of pathogenicity, degree of diversity, and genome 
rearrangement within the species, we sequenced the genomes of 
two strains from human dental plaque. 

Whole-genome sequencing was performed using an lUumina 
Hi-Seq 2000 platform with an insert size of 500 bp, according to 
the manufacturer's instructions. Paired sequences were generated, 
with total coverage of 315 and 310 for PKUSS-HGOl and PKUSS- 
LGOl, respectively. For each genome, de novo assembly was per- 
formed using SOAPdenovo (4), and then the gaps within scaffolds 
were closed with GapFiller (5). Open reading frames (ORFs) were 
predicted with Glimmer 3.02 (6). Putative ribosomal binding sites 
were identified with RBSfinder (7). tRNAs and rRNAs were pre- 
dicted using tRNAscan-SE (8) and RNAmmer (9), respectively. 
Functional annotations were performed by search against the 
NCBI nr, Swiss-Prot (10), Clusters of Orthologous Groups 
(COG) (11), KEGG (12), and InterProScan (13) databases. 

We obtained 16 and 17 contigs larger than 500 bp for PKUSS- 
HGOl and PKUSS-LGOl, respectively. The genomic sequence of 
PKUSS-HGOl comprises 1,951 coding sequences (CDSs), 3 rRNA 
loci, and 43 tRNAs, with a G +C content of 36.6%. In comparison, 
the genome sequence of PKUSS-LGOl contains 1,935 CDSs, 3 
rRNA loci, and 39 tRNAs, with a G+C content of 36.6%. 

Totals of 1,606 and 1,598 CDSs were assigned to COGs, and 
1,064 and 1,057 CDSs can be annotated into 146 and 147 pathways 
by using KAAS ( 14) for PKUSS-HGO 1 and PKUSS-LGO 1 , respec- 
tively. AU- to-all BLASTP analysis with PKUSS-HGOl and PKUSS- 
LGOl protein sequences showed that they possesses 1,865 ortholo- 
gous proteins. These genome sequences will enable further study 



of the mechanisms of pathogenicity and evolution and facilitate 
future drug development. 

Nucleotide sequence accession numbers. This whole-genome 
shotgun project has been deposited at DDBJ/EMBL/GenBank 
under the accession number AXSWOOOOOOOO for strain 
PKUSS-HGOl and AXSXOOOOOOOO for strain PKUSS-LGOl. The 
versions described in this paper are versions AXSWOIOOOOOO and 
AXSXOIOOOOOO. 
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